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ENZYME/TAG BINDING AND DETECTION SYSTEM 

FIELD OF THE INVENTION 
The present invention relates to a process for the production and recovery of a 
protein molecule by recombinant DNA technology. An identification peptide and the 
desired protein are synthesized together as a fusion protein that can be purified or 
detected with the use of native or non-active capture proteins. 

SUMMARY OF THE INVENTION 
The current invention describes a method of using a "capture" enzyme to bind 
a specific "tagged" peptide for purposes of affinity purification or detection. If native 
enzymes are used for these purposes, modification to the tagged protein can be 
eliminated by the use of tag sequences that bind but are not processed by the enzyme. 
Alternatively, the enzyme can be chemically or genetically modified so that one or 
more critical amino acid in the active site has been changed, thereby leading to loss of 
catalytic activity. 

The use of enzymes as capture agents offers several advantages not currently 
found within the state of the art. For example, it is possible to select natural or 
recombinant capture proteins with modified binding sites that will have different 
affinities for the same tag. Conversely, it is possible to modify the amino acid 
sequence of the tag to generate high, medium and low affinity peptide tags for use in 
different applications with the same capture protein. Lastly, depending upon the 
application, natural or recombinant capture proteins with increased or decreased 
resistance to denaturation can also be prepared. Such versatility is amenable to the 
development of a variety of standardized binding and elution conditions for the 
isolation of tagged proteins or their complexes. This flexibility can be found in no 
other system available at this time. 
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BACKGROUND OF THE INVENTION 
Rapid affinity purification techniques utilizing immobilized capture agents 
that bind to peptide ligands (tags) biochemically incorporated into the genes of 
interest have become a common methodology used by molecular biologists to rapidly 
purify recombinant proteins. Advantages of these techniques are that they are 
universal (i.e., a peptide can easily be incorporated into any gene through 
recombinant technology) and easy to use (simple equipment: batch or gravity column, 

simple elution protocols). 

There are many different systems currently in use to purify tagged proteins. 
The most popular system used today for such affinity purification is immobilized 
metal affinity chromatography (1MAC), wherein a repetitive sequence of histidine 
residues (usually 6) binds to a metal ion complex (usually nickel) that is attached to a 
resin. Examples of these systems are the QIAexpress™ system from Q.agen and the 
Talon™ system from Clontech. These resins are easy to use, have a high binding 
capacity, and can be regenerated for repetitive use. However, binding and elution 
from these supports have low selectivity and many non-tagged proteins co-purify 
with the tagged protein. In addition, the recovered protein contains high 
concentrations of imidazole and nickel salts. This method has one of the poorest 
purification efficiencies, which limits the utility of this method for purifying 

20 macromolecular complexes. 

The second most commonly used systems are antibody-based methodologies 
such as the FLAG system from Sigma or the HA system from Roche Molecular 
Biochemicals. In these systems, monoclonal antibodies raised against peptide 
antigens are used to capture the specific tagged proteins. After washing to remove 
the untagged contaminants, the tagged proteins are recovered by the application of 
any of a number of specific and nonspecific elution agents. The selectivity of these 
systems is much better than the IMAC system described above. However, the 
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antibodies used in such systems are very costly to manufacture and are easily 
denatured and inactivated when exposed to extremes of pH or chaotropic agents. 

The next class of systems uses proteins as the tags and immobilized cofactors 
or organic ligands as the capture molecule. The two most used systems in this class 
are the Glutathione-S-transferase (GST) system of Amersham Pharmacia Biotech and 
the Thiofusion expression system of Invitrogen. The advantage of these systems is 
the low cost and durability of the capture agent. However, the large size of the tags 
(greater than 20,000 daltons molecular weight) often has a significant negative impact 
on the proper folding and functioning of the recombinant protein. 

The last tag purification technology to be discussed is the carboxyl terminal 
lysine/arginine peptide purification system. Although not originally designed to 
purify tagged proteins, under the right conditions recombinant proteins can be created 
and purified. In this system, anhydrotrypsin (trypsin in which the active site serine 
residue has been chemically converted to dehydroalanine) is immobilized onto a 
support and used to capture peptides having carboxyl terminal lysine or argimne 
residues. Although this system uses one of the capture agents that we will describe 
later, the mechanisms of binding and release of the tagged proteins are totally 
different (see Table 1). 

TABLE 1: Differences Between The Carboxyl Terminal Lysine/Arginine Purification 

Method And The Present Method. 
™ T .vsine/Areinine Method Present Method 



Type of tag sequence 


Lysine/Arginine Peptides 


Substrates/Inhibitors 


pH binding profile 


Maximal 5.5 to 6.5 


Maximal 7.0 to 8.0 


Elution with Benzoyl Argimne 
or Hippuryl Arginine 


Yes 


No 


pH 3.0 elution 


Yes 


Yes 


Internal Lysine or arginine 
peptide binding 


No 


Yes 


Strength of Binding 


Weak 


Strong 


Nature of interaction 


Enzyme/Product 


Enzyme/Substrate 


Binding denatured samples 


Yes 


Yes 
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In comparing the present method with the lysine/arginine method, it should be 
noted that the two properties these systems have in common are characteristics of the 
anhydrotrypsin molecule and not the tag. The literature that demonstrates the 
carboxyl terminal lysine and arginine peptide binding repeatedly teaches that internal 
(i.e., non-terminal) lysine and arginine sequences do not bind strongly to the 
anhydrotrypsin matrix. This view is probably a result of the focus of the studies (the 
recovery of carboxyl terminal lysine and arginine peptides) and not on characterizing 
the properties of the anhydrotrypsin molecule. Our findings on the binding and 
elution characteristics of native and denatured trypsin inhibitors to anhydrotrypsin 
were totally unexpected from the current literature and generally accepted theories. 

All of the existing tagged affinity purification systems have one or more 
deficiencies. In some of the existing systems the binding resin can only be used for a 
few cycles of purification, for others the binding capacity of the resin is very low, 
while other systems are unable to purify proteins as part of a macromolecular 
complex. Some of the current systems have low purification efficiencies and in 
others the affinity resin is very expensive. In addition, some of the systems require 
very large tag sequences that can have a negative impact on the recombinant protein 
structure and function. The majority of the protein-based isolation systems employ 
capture proteins that are not resistant to denaturation and thus have limited utility in 
the purification of denatured or aggregated protein complexes. The apparent reason 
for this is that these proteins were chosen based on their unique binding 
characteristics and not on their resistance to denaturation. 

In contrast, many of the enzymes of the present invention function in adverse 
environments, are resistant to denaturation, and therefore are ideally suited for use in 
this application. The invention described in this document uses small substrate or 
inhibitor polypeptide tags that bind to natural, modified, altered or mutated enzymes 
to purify tagged recombinant proteins. The diversity of the proteins that can be used 
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for this invention and their general resistance to denaturation makes them ideal 
binding proteins for this application. 

The present invention offers significant advantages over previously used 
affinity systems. In the present invention, the binding that is present between the 
enzyme and the target peptide sequence (usually a sequence derived from an enzyme 
substrate or inhibitor specific for the active site of the enzyme) can be selected to 
offer a wide range of binding constants. Thus, tags can be designed to have the 
desired binding and elution characteristics. Within the super family of all enzymes, 
the subfamily of proteolytic enzymes is especially well suited for our needs. Most of 
these proteins are small single chain molecules with molecular weights in the range 
of 1 5,000 to 30,000 daltons. They tend to have very compact structures and a 
propensity to refold back to their native structures. Lastly, proteolytic enzymes tend 
to be more resistant to denaturation, and thus are useful under conditions not suitable 
for many current affinity purification systems. 
! 5 The present invention is based on the concept of using protein-protein 

interactions as the basis for affinity purification. In the most general case, the 
topography of one protein is complementary to that of the other and the two interact 
to form a non-covalent complex. This is the basic mechanism of antibody-antigen 
interactions. However, this type of protein-protein interaction is rather general, and 
20 can be found in many biological interactions. The activities of many enzymes are 

controlled by protein-protein interactions. One aspect of this invention, described 
herein, is a protease and protease substrate system where a protease is a protein that 
fragments other proteins or polypeptides and a protease substrate is a protein or 
polypeptide that specifically binds to the active site of the protease. Inhibitors are a 
25 special subset of these substrates. These polypeptide inhibitors can have a high 

affinity for the enzyme active site and through any of a number of mechanisms form 
a complex that is inactive. In this aspect of the invention, the enzyme (either active 
or chemically/genetically inactivated) is attached to a solid support to create an 
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affinity matrix. Using molecular biology techniques, the peptide sequence that bunds 
,o the active sue of the enzyme is cloned into the gene of interest and this genettc 
construe, ,s expressed ,„ an appropnate protein express™ system. After expresston. 
the tagged protein is recovered by passage of the en.de matenal over a matnx 
contatning the immobilized enzyme. The tagged recombinant protein w„l bod 
strongly to the immobilized enzyme while the other proteins will be washed away. 
The tagged protein is then recovered by more stringent elution condmons. 

DETAILED DESCRIPTION OF THE INVENTION 
In describing the present tnvention, a "natural protein" is defined as a 
pol ype P ,,de with a defined ammo aeid sequence that is capable of being produced ,n 
a living ceU. The gene cod.ng for this protein will be found as a normal component 
of the DNA of a cell or an infecting bacteria or virus. The proteins will have one or 
more defined three-dimensional structures and have one or more defined or unknown 
functions. Our interest is with proteins that are enzymes that interact w,th other 
polypeptides that are substrates, co-factors or inhibitors. Isoenzymes and genetic 
polymorphisms are considered natural enzymes. 

A "modified enzyme" is a protein in which one or more amino actd restdues 
in the active site have been chemtcally or genetically modified to eliminate the 
enzyme's catalytic function while maintaining its ability ,0 bind substrates, co-factors 
and inhibitors. These modifiers include chemical modification of amino actd stde 
chains as well as genetic substitution, insertion and deletion of amino acid restdues. 

An "altered enzyme" is a protein that has been chemically or genetically 
modified a. a nonca.alytic residue. These modifications include chemical 
modification of amino acid side chains as well as genetic substitutions, insertions and 
deletions of amino acid residues. These alterations can have a variety of effects, and 
can be neutral with minima, effect on affinity, specificity, activity or stability (neutral 
modification,. These substitutions can affect the stability of the protein with respect 
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, 0 _en,a, face. Son, — of*. factors ^de temperature pH 
sa „ a „d dena.urmg agents. Other substitutes could have an effee, on ,he catalyttc 
siK and lead to changes in affinity, specificity and actm.y. 

,„ one embodiment, the present invention ,s a system used for affimty 
punficatio, Th,s system composes two elements pnmanly. The firs, element ,s the 
Vanture reagent" and the second is the "tag." 

ta g ed protem can be accomplished by either modifying the peptide science < h 
i that btnds to tbe active site or by using a protein as a capture agent wh.ch ,s 
able to bind but not modify the tag. 

The former optton can be accomplished if the molecular mechantsms of 
catalysts and the acttve she topography are we,, understood. For example, a number 

be used rn thetr natural or mod.fied form for this application. The firs, group that 
1 to tmnd are the proteases such as the serine, sulfhydry, and acidic protease, 
A „of,besepr„,eases„„.y use thetr s.de chain ammo actds to create therr actrve sttes. 
tjs.ng site-directed mutagenesis techn,,ues, these enzymes could he modtfied 

the active site senne residues can be replacedby alanine. In a s.mtlar way the 
coul Lso bereplaced with alantne or serme restdues. Finally, acidtc proteases* 

serine or asparagine residue. 



7 



Docket No. BM1D9809 US 



For those systems using active enzyme as the capture agent the 
preparation of the solid phase absorbent is straight forward. Using any of a variety of 
solid phase supports, the active natural or recombinant enz^e is immobthzed using 
a variety of linkage chemistnes to prepare the capture matrix. Such immobthzatton 
procedures are well known in the art. The only caveat to this process ,s that care 
must be taken no, to immobtlize the capture enzyme with an amino acid resrdue that 
is ,„ or near the active site of the pro.em. If thrs occurs the capture molecule wrll 
have either reduced capacity or affimty for the tag. Bovine trypan is known to lose 
acivt.y upon immobilization unless an inhibitor like benzathine is used to protect 
,he active site. Such concerns are well known to the art when proteins are covalen.ly 



linked to other molecules. 

For embodiments using capture enzymes lacking proteolytic activity, the 
process is often a bit more complicated. As an illustration, the gene for the 
proenzyme trypsinogen can be modified using site-directed mutagenests methods ,„ 
change the active st.e sertne restdue at positton 183 to a.antne. Pubhshed data show 
,ha, chemical modtfication of this restdue does no, distort the active site, and normal 
L-alanine is expected to allow for proper folding. In a similar way, Ac gene for 
enterokmase (more accurately renamed euteropro.ease) light chain can have the 
active site serine residue a. position 1 87 changed to an alanine residue. In Cher , 
a genetic construct can be made containing the gene for the specific pro.em. The 
protein can then be expressed in an appropnate protein expresston system. After 
synthesis, the modified recombinant trypsinogen or enterokinase can be recovered 
any of a number of chromatographic techniques. If necessary, the capture pro.em ._ 
be enzymatically processed to its final form and immobilized to a number of d.fferen. 
supports and used to isolate the specific tagged recombinant protetn. 

A natural protetn that can be used as the capture molecule is human hepann 
binding protein (CAP 37/Azurocidin). This protein is a member of the sertne 
protease famt.y bu, the active site triad restdues have been mu.a.ed to yield a protem 
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wit h an e,as,ase-„ke structure bu, w,,h no enzyme ac,,vi,y. However «h,s pro,* 
1S m capab,e of bmding strongly «o trypsin mhibitors and as such would make an 

excellent capture molecule. 

A ,arge number of opbons are available ,o develop .be tag pept.de sequence. 
P„r methods of the present ,nven.,on employing active enzyrrte capture agents, ««e 
a.eat.asttHreestrate.eswb.cbeanbeused.Combinatonalcbem.stryorsynbe 

peptide synthests techniques can be used to generate a large number of pep.tdes. he 
elative bmdhtg strengths of each peptide to the binding pro,e,„ and ,«s res.stance to 
modtfrcatton is eastly de.en.ined. Once sequences with .he desired characters 
have been MM <hey - be tested for their effectiveness .0 punfy a tagged 
recombinan. pro.etn. Thrs would be a "sbotgun-.ype" (i.e., broadly focused, 
technique and could be applied to any enzyme. 

A more focused approach would be to star. w,«h a sequence havmg a known 
affinity for .he enzyme and .hen ei.her usmg s,,e dtrec.ed mutagenests or chem.cal 
syll .hesis to create analogs tha, could be evaluated for thetr characteristtcs. As an 
elple, for proteases, one could use published da.a on .he amino acd sequences of 
pro.e se tnbtbttors. The basic pancreatic .rypsin inb.bi.ors are known to have very 
,gh bmding constants and res,,ance .o proteolysis. Detatled eva,ua.,on of homologs 
nfthese sequences can lead .„ .he deve.opment of pepfdes .ha. bind .o the en Z y.e 
but are no, cleaved. In another approach, one could ]U s. use .he en.ire ,„h,b,„r 
mdecule or a d.sulftde bndged bmding domain of an .nhibi.or wi.h .he des,red 

characteristics as the tag. 

For methods of the present invention using cap.ure enzymes wh.ch have been 

mod.fied so tha, they can bind bu, no, modify the tag, the same types of 
character,^ could be performed, bu, ,he only data of interest would be the 
bindmg affinity and release cnteria for the tag sequences. In summary, the use of 
enzymes to isolate tagged recombinant protems offers a valuable too, ,o meet the 
practical application of rapidly isolating recombinan, protems. 
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Once prepared, the present invention can also be used for the detection, 
localization or quantitation of the tagged proteins. Similar technologies have been 
extensively developed using monoclonal antibodies as the binding partner lor the 
identification of tagged proteins. Like those technologies, the present system has all 
of the properties required for detecting, localizing or quantitating tagged proteins. In 
its simplest form, a reporter molecule is attached to the capture protein and the signal 
from this molecule is used to obtain the desired information. In general, any label or 
reporter molecule which can be incorporated into an antibody can be incorporated 
into our enzyme system. A representative but not exclusive list of labeling 
technologies includes direct labels such as spectrophotometry or fluorescent 
molecules as well as amplification reagents such as biotm or enzymes. 

The following paragraphs will provide a more detailed discussion of the 
interactions of trypsin and trypsin inhibitors with their implications for a tagged 
peptide purification system. Although there are many different families of trypsin 
inhibitors, their mode of action is believed to follow the same basic mechanism. 
Trypsin inhibitors contain disulfide bonds and these bonds are considered essential 
for inhibitor function. Polypeptide trypsin inhibitors inhibit in a competitive manner. 
Trypsin inhibitor that has had its disulfide bonds oxidized loses its ability to inhibit 
trypsm During the inhibition process a single peptide bond is usually proteolytically 
cleaved in the inhibitor binding sequence and both the native and cleaved inhibitor 
can bind to trypsin. These facts led to the currently accepted theory of the 
mechanism of trypsin and trypsin inhibitor interaction. 

Under this accepted theory, the trypsin inhibitor disulfide bonds are critical to 
lock the inhibitor sequence into the proper conformation so it can properly fit into the 
trypsin active site in the typical "lock and key" model of enzyme-substrate 
interaction. The inhibitor is fragmented at the specific bond and the linked inhibitor 
chains separate slightly and catch themselves into the trypsin active site. This 
increases the binding energy of interaction, reducing the dissociation constant and 
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blocking the active site of the trypsin. The interaction is non-covalen. and 
dissociation and rcassociation can occur. This theory is supported by enzymattc, x- 
ray crys.allographic and physical biochemical findings and for the most par. ■« seems 

to explain the interaction. 

We determmed that reduced, alkylated and denatured trypsm inhibitors bind 
to anhydrotrypsin with about the same affinity as the native inhibitor proteins. This is 
a significant and unexpected observation. These results are consistent with the 
conclusion that the primary structure of the trypsin inhibitor and the topography of 
the trypsm active site are the determining factors for the binding constant of the 
fraction. In addition, it appears that the function of the disulfide bonds is to hold 
the proteolyzed peptides of the trypsin inhibitor together and thus reduce the 
dissociation constant. Even when the proteolyzed inhibitor dissociates from the 
enzyme it can still reassociate with the enzyme and inhibit its activity. In the 
inhibitor without the disulfide bridges, once proteolysis occurs, the peptides can 
individually dissociate from the enzyme. Once separate in solution they are not able 
to interact with the trypsin as an inhibitor, and instead become simply another 
substrate. This then strongly shows that modified proteins like anhydrotrypsin can be 
used as a capture agent and that trypsin inhibitor active site sequences without 
disulfide bridges can be used as tags. 

In higher animals, especially within the pancreas, there exist at least two 
different families of protease (trypsin) inhibitors. These inhibitors are generally 
classified as the Kunitz and secretory families of inhibitors. The Kunitz inhibitors are 
generally intracellular. They form very strong 1 : 1 complexes with trypsin that have 
dissociation constants reported to be as low as l(r" moles/lite, This is a very low 
dissociation constant and is within the dissociation range of the strongest non- 
covalent interactions. This inhibitor is usually not proteolyzed by the enzyme. Upon 
binding, these inhibitors permanently inactivate the enzyme. These proteins are 
about 60 amino acid residues long with three disulfide bridges. The inhibitor 
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sequence is found on the second disulfide loop with the ammo acids which bind to 
the cleavage site (lysine or arginine) being on the carboxy. s,de of the cysteine 
residue. The general sequences of these inhibitor sequences are as follows: 

-C- X-Y-Y-Z-Z- 
wherein 

C = Cysteine 

X = Lysine or Arginine 

Y = small or polar amino acid 

Z = Hydrophobic or aromatic amino acid (Formula 1). 

The consensus sequence for this class of inhibitor is as follows. 
-Gly-Pro-Cys-(«^ (SEQ ^ " 

The secretory trypsin inhibitors in the pancreas are secreted into the 
pancreatic duct along with the zymogen proteases. Upon binding, these inhibitors 
generallypreventtheprematureactivationoftheenz^es. These inhibitors have a 
mU chwea k erbinding constant than the Kunitz inhibitors and the active bond of the 
inhibitor, usuallyfragmenteddunngtheinactivation^ 

eventually be degraded by the trypsin. These proteins have a structure similar to the 
Kunitz inhibitors. They are also about 60 amino acid residues long and contain three 
sim ilar disulfide bridges. The inhibitor sequence is also found in the second disulfide 
loop. The structure in this loop is similar to the Kunitz sequences, and is represented 

by the following: 

-C-A-X-Z-Z-Y-B -Z- 

wherein 

C = Cysteine 

A = variable ammo acids (usually threonine or proline) 
X - Lysine or arginine 
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Y = small or polar amino acid (usually asparagine or aspartic acid) 
B = Hydrophobic (usually proline) 
Z = Hydrophobic or aromatic amino acids (Formula II). 

The consensus sequence for these proteins is as follows: 
- ( A S n/Gly)-Gly-Cy S -(Pro/Thr ) -(Lys/Arg ) -Ile-Tyr- ( Asn/A S p)-Pro-Val-C y s-(SEQID 

NO: 2). 

The snake venom protease inhibitors are variations of the Kunitz pattern. 
They tend to be small proteins of about 60 amino acids with three disulfide loops. 
They have active site sequences with the following pattern: 
-C-X-Y-B-Z-Z- 
wherein 

C = Cysteine 
j 5 X = Lysine or Arginine 

Y = small amino acid (usually alanine) 
B = variable amino acid (usually polar) 
Z = Usually hydrophobic (Formula III) 
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The consensus sequence for these proteins is as follows: 
.Gly-(Arg/Leu)-Cys-Lys-Ala-His-Ile-Pro-Arg- (SEQ ID NO: 3). 

Plants like animals, have a number of families of protease inhibitors. The 
two most common are the Kunitz and the Bowman-Birk families. The Kunitz family 
is a high affinity protease inhibitor that is about 1 90 amino acid residues long with 
two disulfide bndges. The inhibitory site is located in the middle of the first large 
loop. The overall inhibitor site has the following general structure: 
-B-A-X-Y-Y-Z-Z- 
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wherein 

X = Usually Arginine 
A = Usually hydrophobic 

Y = Variable usually small (alanine and threonine) or polar 

Z = Usually hydrophobic (isoleucine, leucine, phenylalanine or tyrosine) 

B = Variable (proline or arginine most common) (Formula IV). 

The consensus sequence is more variable: 
.(Argrt»ro>(Leu/Pro)-Arg.(Ile/SerMThr/Arg)-Phe.l 1 e-Pro.Asp- (SEQ ID NO: 4). 

The Bowman-Birk inhibitor family is similar to the animal secretory family 
of inhtbitors. The proteins are small (70 to 80 amino acid residues) with around 
seven disulf.de bridges. The mhibitors are double-headed with both trypstn and 
chymotrypsin inhibitory sites. The inhibitor sequences are on very ttght dtsulfide 
, 0 „ps. The dissociation constants for the trypsin-inhibitor complexes is between 10 
to 10" 10 M. The general sequence is as follows: 
_C_A-X-A-Y-Z-Z-B-C- 

wherein 

C = cysteine 

A = polar amino acid (usually threonine or serine) 
X = usually lysine 

Y = variable amino acid (usually isoleucine) 
Z = hydrophobic amino acid (usually proline) 
B = polar amino acid (Formula V) 

The consensus Bowman-Birk Inhibitor sequence is as follows: 
-Cys-lle-Cys-Thr-(Lys/Arg)-Ser-lle-Pro-Pro-Gln-Cys- (SEQ ID NO: 5) 
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The last farmly of inhibitors is the bird egg white trypsin inhibitors. Bird egg 
protease inhibitors are commonly referred to as "ovomucoids." These proteins are 
some of the more heavily glycosylated proteins found in egg white. They are 
members of the Kazal family of protease inhibitors, and exhibit a very high homology 
to the secretory mhibitors of other animals. The general sequence is as follows: 

-C-A-X-Z-Z-Y-B -Z- 

wherein 

C = Cysteine 

A = variable amino acids (usually proline) 
X = Lysine or arginine 

Y = small or polar amino acid (usually serine or lysine) 

B = Hydrophobic (usually proline) 

Z = Hydrophobic or aromatic amino acids (Formula VI). 

The consensus sequence for these proteins is as follows. 
-Val-Ala-Cys-(Lys/Arg)-Ile-Leu-(Ser/Lys)-Pro-Val-Cys- (SEQ ID NO: 6). 

Several things should be noted in conjunction with this information: 

a) From X-ray crystallographic data the amino acids on the amino terminal 
side of the active lysine or arginine residue do not significantly contact with the 
active site of the trypsin. Therefore, only the Lys or Arg residue along with the 
following five or six amino acid residues will bind into the active site. 

b) In general all of the inhibitor sequences have a relatively similar 
distribution of residues in their sequence. Following the lysine or arginine residue 
there will be two amino acids that are usually small or polar. Basic amino acids are 
often seen in the second of these two positions. The next two positions are almost 
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always hydrophobic. Only the secretory inhibitors do not follow this pattern but their 
function is different from the other proteins. 

c) Also, our reduction alkylation experiments demonstrated that the cysteine 
residues do not contribute significantly to the binding and so for these sequences this 
amino acid residue could be replaced. 

There are several possible inhibitor tag sequences that constitute specific 
embodiments of the "tag" sequence useful in the present invention. 
Gly-Pro-Ser-Lys-Ala-Arg-Ile-Ile-Arg-Tyr Bovine BPTI 
Ser-Pro-Tyr-Arg-Ile-Arg-Phe-Ile-Ala-Glu Soybean Kunitz 
Ala-Ser-Thr-Lys-Ser-Asn-Pro-Pro-Gln-Ser Soybean Bowman-Birk 
Gly-Arg-Ser-Lys-Ala-His-Ile-Pro-Arg-Phe Sand Viper venom 
Gly-Ser-Pro-Arg-Ile-Tyr-Asn-Pro-Val-Ser Bovine secretory 
Val-Ala-Ser-Arg-Ile-Leu-Ser-Pro-Val-Ser Chicken Ovomucoid domain 3 
Val-Ala-Ser-Arg-Ile-Leu-Leu-Pro-Val-Ser Chicken Ovomucoid domain 4 
(SEQ ID NO: 7 through SEQ ID NO: 13, respectively). 

When used as a tag, these sequences (like any of the tag sequences of the 
present invention) can contain greater or fewer amino acids than those listed above, 
so long as the binding activity with the capture enzyme remains sufficiently strong. 

With regards to these inhibitors, there appears to be at least two mechanisms 
of inhibition. With the secretory inhibitors there are two hydrophobic amino acid 
residues (usually leucine and isoleucme) on the carboxyl side of the active lysine or 
argimne. It appears that this hydrophobic region protects the sensitive amide linkage 
by increasing the energy of activation for the formation of the tetrahedral 
intermediate. This would then slow down the kinetics of hydrolysis. It is well known 
from protein sequence studies that the presence of multiple hydrophobic residues on 
the carboxyl side of lysine and arginine residues greatly reduces the kinetics of 
proteolytic cleavage. 
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The Kunitz inhibitors seem to utilize a different mechanism for inhibiting the 
enzyme. These inhibitors have amino acid sequences, located at the carboxyl side of 
the sensitive bond, that have a very high binding constant to the enzyme. It seems 
that once binding occurs, these sequences have a very low rate of dissociation. Thus, 
these structures seem to inhibit the enzyme by increasing the energy of activation for 
the dissociation of the product-enzyme complex. The venom inhibitors seem to be a 
variation on the theme of the Kunitz inhibitors. 

There are several means of eluting the tagged protein from the immobilized 
capture agent. We have demonstrated that lowering the pH to a value of about 3.0 is 
sufficient to elute the tagged protein. This is a standard method for the dissociation 
of trypsin inhibitor from trypsin or anhydrotrypsin. These conditions are known to 
cause a mild reversable denaturation of the trypsin active site. An elution buffer that 
we have found to be very effective is 50 raM glycine, 10 mM calcium chloride pH 
3.0. Another method of elution useful in the present invention is to use native trypsin 
inhibitor as the elution agent to compete for the binding site of the enzyme. Bovine 
basic pancreatic trypsin inhibitor (BPTI) is an example of a native inhibitor that is 
useful for elution. It is a small peptide (about 70 AA long) and has a very high 
binding affinity for trypsin. Depending on the specific tag that is used, it is possible 
to use many to most of the different trypsin inhibitors. This elution reagent could be 
used at any pH but has its strongest binding at values between 7.0 to 8.5. 

Another alternative is to use a synthetic peptide from the inhibitory sequence. 
This would require a seven to eight residue peptide that would be small enough for 
easy removal. Again elution conditions would be similar to those for the native 
inhibitor. 

Another possibility would be to use low molecular weight competitive 
inhibitors such as benzamidine. This would have an even lower molecular weight 
and would probably be the least costly. Because these inhibitors have a lower affinity 
than the natural peptide inhibitors, higher molar concentrations would be required. 
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,, should be noted that the binding of the trypsin inhibt.or sequences used ,n 
,he present invention to the antitrypsin has a very different mechanism of binding 
and elution from that of polypeptides containing carboxyl terminal lystnc and 
argmine residues (see Table 1). First, the optimal binding pH values are dtfferen, 
with the present tnhibitor sequences binding stronger at neutral to slightly alkahne (7 

8 5) and the carboxyl terminal lysine and arginine peptides binding stronger at 
slightly acidic values (5.5 - 6.5). Ftnally the carboxyl terminal peptides can be elated 
from the anhydrotrypsin by low molecular weight arginine peptides (htppuryl 
arginine and benzoly arginine) while the present inhibitor sequences are not. The 
binding and elution characteristics of these structures are totally different and seem to 

follow distinct mechanistic pathways. 

While we have studied the protease-protease inhibitor system in great detail, >t 
is „„, the only possible example for this system. There are many additional enzymes 
that could be used as capture molecules. Two additional enzyme families that could 
be used as capture molecules are kinases and glycosylating enzymes. The kmases are 
a large family of proteins whose function is to place a phosphate group on spectftc 
senne, threonine or tyrosine residue, Some of these enzymes only recogntze very 
specific sequences and could perform as very high stnngency capture agents. Thts 
family of protetns can be divtded into a number of sub famtlies of enzymes based 
upon the sugar chain that will be covalen.ly linked and the specific amino acd betng 
modified One of the best charactenzed of these enzymes is the asparagme - htgh 
mannose glycosylating enzymes. These proteins transfer a preformed high mannose 
carbohydrate structure to specific asparagine residues. These enzymes only 
recognize asparagine residues in specific exposed sequences which have the structure 
Asn-Xaa-(Ser/Thr)-, where Xaa can be any amino acid so long as the sequence ,s 
sufficiently solvent exposed to fit into the enzyme active site. Modification of the 
carbohydrate binding site or the residues responsible for the transfer of the 
carbohydrate chain onto the asparagine residue could convert this enzyme into a 



18 



Docket 



No. BMID ( )S()9US 



ags indrnduats fa mi ,iar with the art can identify ntany nrore exanrples of en >™ 

be used as tags. f 
The present inventton ptovides an efficten, process for the recovery of 

en^e capture protet. The tag can be a natura, or sy„.net,c potypept, e subarate, 
JL cofactor or effector anr.no ac,d seo,ence. The capture tncdecuie ,s a 
Lyme protei^thathasahindtn.s.tefort.tetagscuence. Thts capture rnoiecuie 

che nicaiiy) so as to s,,,. be abie to bind the tag sec,ue„ce but no. mod.fy , A large 
riroLodintentsofthisconceptareposstbieandseveraiwmbedescrtbed 

below. 

EXAMPLES 

^ One entbodtnren, of thts tnventton envoys a modtfted trypstn as the capture 

protetn and the acttve sconces of trypstn irWbitors as the tags. The ,yp,n 

L,ecu,e (either natura, or —an,, has an active site amtno ac, - 

th a, the protein is ,i.l capab.e of binding the tag (tnhibttor, bu, ts not capabie of 
X^^entatton.Anhoughanu.berofresiducs.ntheacttves.teoftryps.n 

is respons.ble for peptide bond hydrops (,n bov.ne trypstn tbts ,s restdue 77> A 
Zer of pubiications have reported methods to acconrphsh «h,s ntod, cat.on bu, 
rlhodoftshtietaiaMethods in En^.ogy 3,8 - 3S3, was found tobethe 

fol ,ows. T^n ,s fust reacted with phenyintethyisuifony, chionde ( PMSP, to font. 
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,„en treated w,,h a.kal, to induce a beta elimination of ,h= modified senne res.due 
wW. the formation of a dehydroalanine residue. The dehydrated trypsm 
<anhydro,ryps,n) ts then punfied on an argtnine affim.y resin. The final produc, „ a 
pr „,e,„ with the active site of ,ryps,n but without the ability to fragment pep.tde 
bonds The anbydrotrypstn can be attached to a variety of sohd supports and used as 
an affinity matnx. Within the h.erature a number of these supports have been 
prepared and the,, abtl.ty to bind carboxy. terminal ,ysine and argtnine peptides has 
been characterized and is well-known in the art. 

We have evaluated the binding and release of native and denatured trypsm 
inhibitors to the immobilized anhydrotrypstn as a model to characters the 
purtfication of tagged proteins. First we de.en.ined the pH prof,, of bind.ngof the 
tr yps,n .nhtbitors to the anhydrotrypsin solid support. What was observed was a 
profile tha, matched the activity vs. pH plot of native trypsin w,th maxtma, b.ndmg 
b e,„g observed between pH values of 7.0 to 8.0. This is m contrast to the carboxy. 
.ermmal lysme or argintne peptides tha, have a pH maxtmum of bmding around pH 
5 0 to 5 5 Also, the carboxyl terminal lysine and arginine peptides can be elu.ed by 
orgamc denvatives of argmtne while these reagents do no, e.ute trypsm tnh,b,tor 
sequences. The denotation of ,he trypsin inhibitors with urea and guamdtne 
hydrochloride shows little loss of bmding capacity and this seems follow the 
denature of trypsin by these reagents. Lastly, completely denatured, reduced and 
carboxymethylated trypstn inhibitors were found to bind a, leas, as well as the native 
proteins to the anhydrotrypsin in buffers with and wi,hou, urea or guan.dme. Th.s 
bmding of the denatured trypsin inhibuors ,0 the anhydrotrypsin support prov.des 
strong ev.dence tha, the specific bmding of ,hese pro.ems is driven by ,he pnmary 
sequence of the inh.bitor binding regron and no, by its conformation and presentation 
to the trypsin molecule. 
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u wh-litv Samples of this resin were repeatedly exposed to denaturing 

of binding capacity. Also, umum& p 

S r„H 2 5 to 3 0) Samples of the resin have been kept at + 2 to 8 C 
acidic conditions (pH 2.5 to *.v). v 

to g ne for .rypsin and rep.ace ,He ac.ive s,.e senne r es,due ano.ner 

structuI a, and steric Condons i„d,ca t e ,ha< an a,a„,ne res^e » a preferr 

te re are many chymolryrsin inhibi.o, sequences avadaMe. 
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a 11 nf these enzymes use the same active 
homologies to trypsin and chymotryps.n. All of these enzym 

site residues to fragment their substrate moleeule. 

Euterokinase is an intestine mucosal enz^re to recognizes the ammo 

L to secuence. The key segment of thrs se q uence ,s four aspartrc actd restdues 
foho.edhyal.srneresrdue.Entero.naseeleavesthepep.rdehondat.ecarho.i 

sM e of the lysine res.due. ,n one embodrment, the aetrve site serine resrdue can 
modified as reported above for trypsin and chymotrypsm. 

Enterokmase can be isolated from intestinal mucosa or may be produced 
synthetically or by recombinant technroues. In one embodrment, the actrve site senne 
(residue . 87 of the mature protein) ts converted to an a,anine residue to 
protern to can bmd its substrate but no, proteolytic* fragment the molecule. 
Other substitutions withm the active site are possrble bu, from our investrga ons and 
to of others it seems to the senne resrdue has the leas, influence on substrate 
Onhrbitor, brndmg. An mterestmg obse.ation ,s to most plasmids ,ha„nc ude a 
restriction protease cleavage stte use the enterokmase se q uence ,0 allow for tag 
removal. Thus, for this system, there would be no need for the preparatron of new 
vectors for the expressron of the tagged recombinant protems. 

Another example of enzyme-substrate combinations useful in the present 
mvention rs factor Xa and its cleavage peptide. Factor Xa is also a senne protease 
a „d one of the enzymes ,n the clottmg pathway. Factor Xa, like enterokmase, as 
very stringent seouence retirement and wil, only cleave a peptide ^< 
arginine of the sentence tso.eueine - glutamic acid - glyeme - arginme. As wrth *e 
examples above, the acttve site senne resrdue of factor Xa can be modrfred by e„h r 
cheml, or genettc means to y,e,d a capture protein without the abihty to fragment 
the polypeptide chain. The factor Xa cleavage site is the second most popular 
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cleavage site to be genetical.y engineered into vectors for cloning and so .here are a 
,arge number of tagged protems for which this capture agent would be of value. 

Example 3 

An additional preferred embodiment of this concept would be to use a 
modified psychrophilic enzyme as the capture protein. Psychrophi.ic enzymes are 
proems adapted to function at temperatures approaching 0»C and are usually found 
m n „„.warm blooded organtsms in the Arctic and Antarcttc. The ability to function 
a, low temperatures comes a, a cost. These proteins have reduced therma, stabth.y. 
The trypsin isolated from North Atlanttc cod has been shown ,0 thermally denature a, 
,5 »C lower temperature than bovme trypsin and the trypstn front the Antarcttc fish 
Paranotothenia ntagellanica ,s macttvated a, temperatures above 30°C. The genes for 
these proteins have been cloned and sequenced. These proteins show htgh homology 
,0 other fish and mammalian tryps.ns with unique eold adaptations. Like the capture 
„ described in example 1 , these enzymes could be chemically modified or 
thetr genes cou.d be modtfied to convert the active site senne residue to an alan.ne or 
other inert amino acid. These proteins would be immobilized and used as capture 
agents as described in example ! . The only difference is tha, elu.ion could be 
performed by just ratsing the temperature. From a production and handhng 
standpoint there would be significant advantages to such a system. 

The present invention has been detailed both by direct descrtptton and by 
example. Equivalents and modtfiea.ions of the present invention will be apparent to 
those skilled in the art. and are encompassed wtthin the scope of the invention. 
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